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Abstract. In |4J, Diaconis used contour integration to give an asymptotic expansion of 
12n<x iL '( n ) U P t° an error of ( q^^h ) f° r an Y integer k, where w(n) = J2 P \ n 1 denotes 
the number of distinct prime factors function. In this article, we use the hyperbola method 
£\J | to provide a more precise formulation with an error term of the form O ^xe~ Cx/logI j , and 

deduce the expansion in [4] as a consequence. We also show that Diaconis' expansion follows 
from the earlier work of De Bruijn [2] on integers without large prime factors. Using this 
expansion, we then prove that if M(x) denotes the median largest prime factor of the integers 
in the interval [1,2;], we have 

Mix) = e^x^ | 1 + | — !— 
V \\ogx 

where 7 is the Euler Mascheroni constant. Our method is then applied in an identical way to 
obtain an asymptotic expansion for the average largest prime factor, ^2 n<x P(n), reproving 
' results of [U E], with the advantage that the constants in the expansion are given explicitely, 

■ and an integral function is given with a more precise error term. 

> 



1. Introduction 

> " 

£S) \ The number of distinct prime divisors function, co(n), has been the subject of study for 

centuries. In 1917 Hardy and Ramanujan proved that u(n) has normal order log log n, that 
is co(n) ~ log log n for almost all integers n. Turan reproved this result in an elementary way 

^ by showing that both the mean and variance of u(n) on [l,x] are asymptotic to log log x. [7J 

In 1976 Diaconis used contour integration to give an asymptotic expansion for the mean and 
variance of u(n) [3], and in particular for the mean he proved that for any integer k, 

t\ / \ , ^ x x (k — l)!sc „ / x \ 

(1.1) > u){n) = -xloglogx+Bxx+ca- \-c r — 5 — I hc fc -i— — r hO - — 7— r - , 

fri logx \og 2 x log k x \\og k+1 xJ 



for some computable constants Ck- In section [2] we use the hyperbola method to prove that 
the following theorem: 

x \-\ 

Theorem 1. Letting lif(x) = \^[dt, we have that 

cj(n) = x log logx + B x x — lif(x) + O (xe~ 

n<x 

and for any integer k 

(1.2) li f (x) = co-^— + ci-4- + • • • + Cfc-i \ ~l' lX + O f 



logx log x log x \log 



fc+1 



where c n = 1 — X]fe=o TH k ' ^ k denotes the k th Stieltjes constant, and c n — > | as n — > 00. 
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From this theorem equation 11.11 follows as a corollary, and in section [3] we use the results of 
[2] and [6] to provide an additional proof of this result. With this expansion, we prove the 
following regarding the median prime factor: 

Theorem 2. If M(x) is the median prime factor of the interval [l,x], then 

M(x) = e^x^ (l + O ' 



log x t 

In section [3J using Theorem [TJ a more quantitative version of this theorem is given in terms 
of the function li/(x) with an error term of the form e _Cv/log:r . 

In our final section, we examine Yl n <x P{ n )i the average of the largest prime factor of n. This 
was first looked at by Erdos and Alladi in [lj, and they proved that Yl n <x P{ n ) ~ f^i^J- 
Later, in [3], De Koninck and Ivic showed that Yl n <x P{ n ) has an asymptotic expansion with 
computable constants. Using the method of proof of Theorem (U we are able to prove the 
following: 

x \-\ 

Theorem 3. Letting li g (x) = J 2 flogt^' we ^ ave that 

P{n) = xli g {x) + O (arV^ 5 * 5 ) . 



n<x 



and for any integer k 



/, „n ,• / x , x l\x (k — l)\x / x 

(1.3) h g (x) = do— + d x - -J + • • • + 4-i^- ' W + O 

logx (logx) (logx) y(logx) 

where d n = Y^j=o 23 ^~ 1 ^ , with C^(2) denoting the j th derivative of ((s) evaluated 
at 2, and d n — > 1 as n — > oo. 

2 

Notice in particular that d = j^, so we deduce that 

y.P(n) = — - + 



t< t 121o & x V(loga;) 2 

Although this expansion has previously been presented in [3], and again in [5], there are some 
avantages to the result above. We give an explicit form for the constants in terms of the 
zeta function, and our proof is basic requiring only one application of the hyperbola method. 
Also, we give the explicit integral form which has an error like that of the prime number 
theorem mimicking how tt(x) is approximated by the function li(x). 

It is worth mentioning why the notation \if(x),\i g (x) is used for the above integral functions. 
This is because not only do they mimic li(x) = j^dt in terms of their definition and the 
error terms which appear, but also because they have a nearly identical asymptotic expansion. 
Recall that for li(x) = i^dt, by integration by parts we have that 

,„ , s ,. , . x x 2\x (k — l)\x 



logx log x log x log x \log + X, 

This is very similar to both equations 11.21 and 11.31 an d the similarities are made clearer by 
the fact that as n — > oo, 

c„ -> i and d n -> 1. 



2. The Mean Value of u>(n) and Q(n). 

Our goal is to find a more precise asymptotic formula for ^2 n<x uj(n) and J2 n<x Q(n). We 
start by looking at the sum of u(n). Since u(n) = Yl P \ n 1> by rearranging, we have that 



x 



The prime number theorem tells us that 

(2.1) = loglogx + 5i + (e 



Sip}- 



— cWlog x 



p<x 



where B\ = 7 — ^ p J^ fc>2 ^rr is Merten's constant, so we conclude that 
J2u(n) = xhglogx + B lX -J2 {-} + ( xe ~ cVI ^ 

n<x P<£ (.P J 

From here we need only evaluate the sum of fractional parts over the primes. Since the primes 
have density — ^ we expect that Yl p < x {§} ~ I2 If } Ioft> an< ^ ^his * s indeed the case. The 
error term is managed by using the hyperbola method along with the prime number theorem, 
and we have the following proposition: 

Proposition 4. Letting lif(x) := f£ i^jdt, we have that 



p<x 



V 



cWlog x 



Proof. Let 1 < B < x be some integer . Splitting into intervals and rearranging the order, 
we have that 




E 

§ <p<x 

(2.2) 

n<B-l 

By the prime number theorem this is 



+ 7T 



X 



B- 1 



13 ^<!) 



(ft + O ( £ (VV 5 ^ 



n<S-l 



2L lOSt 

B 

Splitting up the floor function, the main term is 



L ^Tt dt = x { lo ^ x ~ (§)) - f s 



D 

cWlog x 



log* ' 



and hence since ^2x_ <p<x p = loglogx — loglog (-|) + O c ^ lo s x j by 12. 1[ we have that 

X )= f { ^dt + 0(xe-^ log B). 
_p\ J* logt V J 



E 

%<p<x 



The result then follows by choosing B = y/x and noting that we can extend the sum and 
integral to start at 2 as/ 2 ^ &*dt = O 0) and { f } = O 0) . □ 

Corollary 5. We have that 

y~^a;(n) = slog log a; + — + O (xe~ c ^ Xogx ^ 



n<x 



and 



where 



f)(n) = s log log x + B 2 x - li f (x) + O (xe~ c ^°^ 



n<x 



2 1 p(r> — 1 



p(p-l) 

Proof. The first identity is immediate from what has been done thus far. As Y2 n <x ^( n ) 



\ we note that difference is given explicitely by 



E n (»)-<"(") = EE 

p<x k>2 



n<x 



V 



V 



x 



2<fc<log 2 (x) V<sfx 



- E E 

2<fc<log 2 (x)p< v ^ 

Extending the sums to oo and rearanging we have Yl n <x^( n ) ~~ u ( n 
O (y/xlogx) which implies the result. 



E E ^+°(v^ lo g^)- 



xY, 



i + 

p p(p-i) 

□ 



2.1. The Function H/(x). In this subsection we provide an asymptotic expansion for \\f(x). 
We will make use of the laurent expansion of ((s) which is given by 



(2.3) 



C(s) 



^ + E^-ir. 



n=0 



rv. 



where the 7 n are the Stieltjes Constants, and 70 is the Euler-Mascheroni constant. The 
expansion for li/(x) can then be expressed in terms of these constants. 

Proposition 6. For any integer k > we have 



(2.4) 

where 



lif(x) = c — h ci 



2\x (k-l)\x „ 
2 +C2- — 3 1 hc fe „i— — z YO 



logx log x log X 



log fc x 



x 



log fc+1 x 



n 1 

1 " E ^ 



k=0 



By using the fact that £(0) = — \ along with Abel's Limit Theorem we find that YlT=o h^ k = 
I since this series is absolutely convergent. This tells us that as n — > 00, c n — > \ which is 
roughly what we might expect since f 2 {t}dt ~ | .From Proposition [H] along with Corollary 
Owe are able to deduce Theorem [TJ To prove this proposition recall that for all s ^ 1, 
Re(s) > 0, we have 

1 f 00 

(2.5) C(s) = 1 + 1 - s I {x}x- s ~ 1 dx, 



which allows us to prove the following lemma: 



Lemma 7. For any integer n > we have 

{u} 



n 2 



\ k=0 / 



(log w) n du = n\ 1 - \ — 7„ = n\c. 



Proof. Let 6 n = J™ ^ (logu) n rfu. Then to find each b n , consider the generating series 

°° I POO I \ POO 

^n! J 1 u 2 A 



By 12.51 this equals \ x . Using |2~31 we then have 



E>" = X>™ (i-E,,, 

n=0 \m>0 / \ n>0 

and upon comparing coefficients, the result follows. □ 
The proposition now follows from some basic substitutions: 
Proof. In the equation 

(2.6) Mx)= f^Bdt + of^L 

let it = xjt. This allows us to rewrite the integral as 



'V* lo S* A w 2 log(f) ' logx,/i m 2 V. logo; 

Let A; > be some integer, and expand the geometric series 

1 logi/ A' 1 _ l | logw | | ^ logu ^ 1 | ^ logju y ^ log^V 1 

so that 



log x J log x \ log x ) \ log x / \ log X 



ncr <r ' ' ncr t n^^-^^Ti 7; z \ ncr t / 



^ log* lo ga;^ (logs) (logx) fc+1 7i u 2 1 \ logx 

Since logw < \ logs, it follows that 

so the last term contributes an error of the form Ok \ 7^£vt~\ ■ Since we may bound the 
integral 

— — (logw) du < — du = O ' 



it follows that 

[ ] log* logx^ (logs)" +Uk \\og k+l x, 

The stated result then follows by combining equation 12.61 and 12.71 along with Lemma UJ □ 



x 



3. The Median Largest Prime Factor 

Let ip(x, y) denote the number of integers n with 1 < n < x, all of whose prime factors are 
< y. Then the median prime factor M(x) for the integers in the interval [1, x] satisfies 

■0 (x, M(x)) = -x. 

Our main result is the following: 
Theorem 8. We have that 



M(x) = x^ CxP ( '*J+0 [x^e~ cV ^j . 



Before proving Theorem [HI we show that Theorem [2] follows from the above along with 
Proposition the asymptotic expansion for li/(x). 



Proof. Since 

li f (x) 1 - 7 



O 



x log x \ log 2 X J ' 
by using the series expansion for e x we see that 

exp (JW)) =1 -hzl + o(J- 

\ x J logx \log 2 x 

From here, the series expansion for e x applied once again tells us that 



(3.1) x^ V x J = e — x —, + 



exp ( - l lfSml \ 7-1 i I X^ 



1 



logx J 

proving the desired result. □ 

Let ip(x, y) denote the number of integers n with 1 < n < x, all of whose prime factors are 
< y. If x 1 / 2 < y < x, then n < x implies that at most one prime p > y can divide n, so we 
have that 

^(x,y) = [x\- E 1= w- E 

y<p<x n <C x V<P<x 



p\n 



Since 



we see that 



E l-j = * E 1 - E 

fc 7} ■ 77 ■ I 77 



(3.2) VO, y) = x - ^ w(n) + E ^ + fa) + 

so that by Theorem H] we have 



xe~ c ^ logx 



P 

n<x p<y 



(3.3) i>(z,y) = x ( 1 - lo 



logx 



J + li/(x) + O (jr(y) + xe~ cV ^^ . 



In the next subsection, we will use equation 13.21 and a result of De Bruijn to provide an 
alternate derivation for the asymptotic expansion of Yl n <x u ( n ) & ven by Diaconis. With 
equation 13.31 we are now ready to prove Theorem|HJ 



Proof. Let M(x) be the median prime factor for the interval [l,x] so that 

(x, M(x)) = -x. 
Then by equation 13. 'S\ we must have that 

and hence 

Taking the exponential of both sides, and noting that exp (^0 ^e~ Cv/loga: ^ = 1+0 ^e~ c ^ log:c j . 
we find that 

yilogM(x) _ ^ fJW)\ + Q / _ cvEIi 



log a; \ x 

Multiplying through by and taking the exponential once again, we find that 

v e 



M(x) = x^ CxP ( '* ) + O [x^e~ cVT ^ 



proving the result. □ 

3.1. Integers without large prime factors: De Bruijn's result. In [2J, De Bruijn 
showed that 

ip{x, y) = A(x, y) + (xe~ c ^ / ^^ 

where 



jo \ logy J t 

and p{u) denotes the Dickmann De Bruijn rho function. The error term in the above was 
later improved by Saias [6]. In his paper, De Bruijn also gave an asymptotic expansion for 
A(x, y), in terms of computable constants, however we will use the version appearing in Saias' 
paper as it is easier to work with. Specifically Saias shows that if u = so that a> = y, 

3 

and u f. N, u < (log y) 5 , then we have 

(3-4) A(s, y) = x £ a k f-Vi + O n>£ x^—^ 

where 

a = 1, a k = - J — (logt) dt. 

In what follows, we will have no use for the range on u, as what we are interested in is when 
1 < u < 2. On this range, we have that p{u) = 1 — logu so that p (u) = — and 

„<*,(„) = 

u K 

We note that, by El for k > 1 

fe-i 



«» = (-i)Vi = (-lHi-E^ • 



i=o J 



Since - 1 sk = — - , equation 13.41 becomes 

(logj/) fc (log a!)*' H 



(3.5) 



A(x, y) = xp{u) + £ 1)! ^ fc fc - 1 + O n ( 



x 



(log X 



,n+l / ' 



and hence using the fact that for 1 < u < 2 we have p(u) = 1 — log u — 1 — log log a; — log log y, 
and by 13.21 conclude that 



5>(n) = loglog, + * - £ + O n (-— *) 

tTi TTf* Oogx V logx J 



n-1 

„<x fe ^(log^) fc+i ' "'V(logx) 

This reproves 11.11 Diaconis' expansion. 



Remark 9. Rather than providing the elementary derivation of Corollary [HJ and using this to 
prove Theorem O we could have proceeded directly from the results of Saias and De Bruijn 
using equation 13.51 to prove Theorem [2j One of the goals the author had in writing this was 
to show how all of these things are interrelated and equivalent. 



4. The Average Largest Prime Factor 



In this section, following the proof of Theorem [T] very closely, we will deduce Theorem [31 
First, note that each integer n may have a most one prime p > yfx such that p\n, and for 



each prime p, there are 
two facts, we see that 



integers divisible by p which are less then x. Combining these 



p 

\fx<p<x 



n<x 
3 



p<x 



and since £ p <^p [§] < x Ep^ 1 = (ife) > [t follows that 



J>(n) = P 

n<x ^/x<p<x 



+ 



3 

X 2 

logs 



Define K,(x) = ^2 p<x p, so that by the prime number theorem along with partial summation, 
we have the estimate 



(4.1) 



K{x) 



t 



2 logt 



dt + 1 x 2 e 



„2 „— c-v/loe x 



Letting [y/x\ = K, and following equation 12.21 the above is then 



K<p<x 1 



n<K-l 



n [ J2 ! ' 

'X 

> n 



K{x) 



.2 



AX [ £ 



n<K-l 



and hence by 14.11 



(4.2) = f tlAdt + O (x 2 e~ cV ^) 

A/slog^ v ' 



*[f] 

(4.3) = x\i g {x) + (x 2 e~ c ^) , 

where the last equality follows since i^di = O Turning our attention to this 

integral function, we make the substitution t — - to get 



u 



(4.4) / "-^-dt = x 2 / —^-r^-du, 

Setting dfc = h fi° ^ (logii) fc g?w, and by following a proof identical to that of Lemma [6j we 
have that 

2 1 1^2 / _ i M„2 



*>AU , x 2 , A l\x 2 , , (k-l)\ 



X 



(4.5) / t^-dt = d - +d 1 / ' x „ H hrffc-i- ^ + C 



logt"' ~ u logx ' ^(logx) 2 (logx) fc \(logx) fc_1 

Similar to equation 12. 5[ integration by parts tells us that for s > 1 

C(s) = ^^~ S = j x~ s d[x] = s j [x\x~ s ~ l dx 



n=l 



so that exponential generating series yield 

C(2-:! 

= / [X'j X ax 

■ 



LXJ /*00 

^<i fc £ fc = / [x]x z ~ 3 dx 
k=o ~ Z 

Using the Taylor series for ((2 — z), this is 



so we see that 



\j=0 J ' J \ k=0 / n=0 k+j=n J ' 

1 " g(-lKg_g) 
J'=0 J ' 



which establishes Theorem [3j Lastly, we note that 

(_i)i C o-) (2) _ i - (bgfey ^ i r(hgxy dx 

A substitution shows that this last integral is the Gamma function, and that it evaluates to 
exactly j\ This allows us to prove that 

( ~ j) V j) (2)-l->0 



as j — > oo, and from this we deduce that as n — > oo 



rfn ^^-rf"2 j ->• 1. 

On+l / / 



2 r 

j=0 
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